Density estimation with non–parametric methods
نویسندگان
چکیده
One key issue in several astrophysical problems is the evaluation of the density probability function underlying an observational discrete data set. We here review two non-parametric density estimators which recently appeared in the astrophysical literature, namely the adaptive kernel density estimator and the Maximum Penalized Likelihood technique, and describe another method based on the wavelet transform. The efficiency of these estimators is tested by using extensive numerical simulations in the one-dimensional case. The results are in good agreement with theoretical functions and the three methods appear to yield consistent estimates. However, the Maximum Penalized Likelihood suffers from a lack of resolution and high computational cost due to its dependency on a minimization algorithm. The small differences between kernel and wavelet estimates are mainly explained by the ability of the wavelet method to take into account local gaps in the data distribution. This new approach is very promising, since smaller structures superimposed onto a larger one are detected only by this technique, especially when small samples are investigated. Thus, wavelet solutions appear to be better suited for subclustering studies. Nevertheless, kernel estimates seem more robust and are reliable solutions although some small-scale details can be missed. In order to check these estimators with respect to previous studies, two galaxy redshift samples, related to the galaxy cluster A3526 and to the Corona Borealis region, have been analyzed. In both these cases claims for bimodality are confirmed at a high confidence level.
منابع مشابه
تخمین احتمال بزرگی زمینلغزشهای رخداده در حوزه آبخیز پیوهژن (استان خراسان رضوی)
Knowing the number, area, and frequency of landslides occurred in each area has a prominent role in the long-term evolution of area dominated by landslides and can be used for analyzing of susceptibility, hazard, and risk. In this regard, the current research is trying to consider identified landslides size probability in the Pivejan Watershed, Razavi Khorasan Province. In the first step, lands...
متن کاملتشخیص سرطان پستان با استفاده از برآورد ناپارمتری چگالی احتمال مبتنی بر روشهای هستهای
Introduction: Breast cancer is the most common cancer in women. An accurate and reliable system for early diagnosis of benign or malignant tumors seems necessary. We can design new methods using the results of FNA and data mining and machine learning techniques for early diagnosis of breast cancer which able to detection of breast cancer with high accuracy. Materials and Methods: In this study,...
متن کاملNon-parametric kernel density estimation of species sensitivity distributions in developing water quality criteria of metals.
Due to use of different parametric models for establishing species sensitivity distributions (SSDs), comparison of water quality criteria (WQC) for metals of the same group or period in the periodic table is uncertain and results can be biased. To address this inadequacy, a new probabilistic model, based on non-parametric kernel density estimation was developed and optimal bandwidths and testin...
متن کاملStochastic Non-Parametric Frontier Analysis
In this paper we develop an approach that synthesizes the best features of the two main methods in the estimation of production efficiency. Specically, our approach first allows for statistical noise, similar to Stochastic frontier analysis, and second, it allows modeling multiple-inputs-multiple-outputs technologies without imposing parametric assumptions on production relationship, similar to...
متن کاملKernel Estimation in High-Energy Physics
Kernel Estimation provides an unbinned and non-parametric estimate of the probability density function from which a set of data is drawn. In the first section, after a brief discussion on parametric and non-parametric methods, the theory of Kernel Estimation is developed for univariate and multivariate settings. The second section discusses some of the applications of Kernel Estimation to high-...
متن کاملKernel Density Estimation for An Anomaly Based Intrusion Detection System
This paper presents a new nonparametric method to simulate probability density functions of some random variables raised in characterizing an anomaly based intrusion detection system (ABIDS). A group of kernel density estimators is constructed and the criterions for bandwidth selection are discussed. In addition, statistical parameters of these distributions are computed, which can be used dire...
متن کامل